pFind 2.0: a software package for peptide and protein identification via tandem mass spectrometry.

نویسندگان

  • Le-Heng Wang
  • De-Quan Li
  • Yan Fu
  • Hai-Peng Wang
  • Jing-Fen Zhang
  • Zuo-Fei Yuan
  • Rui-Xiang Sun
  • Rong Zeng
  • Si-Min He
  • Wen Gao
چکیده

This paper describes the pFind 2.0 software package for peptide and protein identification via tandem mass spectrometry. Firstly, the most important feature of pFind 2.0 is that it offers a modularized and customized platform for third parties to test and compare their algorithms. The developers can create their own modules following the open application programming interface (API) standards and then add it into workflows in place of the default modules. In addition, to accommodate different requirements, the package provides four automated workflows adopting different algorithm modules, executing processes and result reports. Based on this design, pFind 2.0 provides an automated target-decoy database search strategy: The user can just specify a certain false positive rate (FPR) and start searching. Then the system will return the protein identification results automatically filtered by such an estimated FPR. Secondly, pFind 2.0 is also of high accuracy and high speed. Many pragmatic preprocessing, peptide-scoring, validation, and protein inference algorithms have been incorporated. To speed up the searching process, a toolbox for indexing protein databases is developed for high-throughput applications and all modules are implemented under a new architecture designed for large-scale parallel and distributed searching. An experiment on a public dataset shows that pFind 2.0 can identify more peptides than SEQUEST and Mascot at the 1% FPR. It is also demonstrated that this version of pFind 2.0 has better usability and higher speed than its previous versions. The software and more detailed supplementary information can both be accessed at http://pfind.ict.ac.cn/.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

pFind: a novel database-searching software system for automated peptide and protein identification via tandem mass spectrometry

SUMMARY Research in proteomics requires powerful database-searching software to automatically identify protein sequences in a complex protein mixture via tandem mass spectrometry. In this paper, we describe a novel database-searching software system called pFind (peptide/protein Finder), which employs an effective peptide-scoring algorithm that we reported earlier. The pFind server is implement...

متن کامل

Exploiting the kernel trick to correlate fragment ions for peptide identification via tandem mass spectrometry

MOTIVATION The correlation among fragment ions in a tandem mass spectrum is crucial in reducing stochastic mismatches for peptide identification by database searching. Until now, an efficient scoring algorithm that considers the correlative information in a tunable and comprehensive manner has been lacking. RESULTS This paper provides a promising approach to utilizing the correlative informat...

متن کامل

Speeding up tandem mass spectrometry based database searching by peptide and spectrum indexing.

Database searching is the technique of choice for shotgun proteomics, and to date much research effort has been spent on improving its effectiveness. However, database searching faces a serious challenge of efficiency, considering the large numbers of mass spectra and the ever fast increase in peptide databases resulting from genome translations, enzymatic digestions, and post-translational mod...

متن کامل

A Nonlinear Scoring Framework for Peptide Identification via Tandem Mass Spectrometry

The problem of false positives in peptide identification via tandem mass spectrometry (MS/MS) by database searching remains unsatisfactorily resolved in the current proteomics research. The correlative information among fragment ions in the MS/MS spectrum can be very helpful for reducing the number of false positives. However, due to the computational difficulty, existing peptide-scoring algori...

متن کامل

Improved proteomic analysis pipeline for LC-ETD-MS/MS using charge enhancing methods.

Electron transfer dissociation (ETD) is a useful and complementary activation method for peptide fragmentation in mass spectrometry. However, ETD spectra typically receive a relatively low score in the identifications of 2+ ions. To overcome this challenge, we, for the first time, systematically interrogated the benefits of combining ion charge enhancing methods (dimethylation, guanidination, m...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Rapid communications in mass spectrometry : RCM

دوره 21 18  شماره 

صفحات  -

تاریخ انتشار 2007